This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



^3 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
27 December 2002 (27.12.2002) 




PCT 



(10) International Publication Number 

WO 02/103026 A2 



(51) iDternational Patent Classification^: C12P 21/00 

(21) International Application Number: PCT/EP02/0S7S2 

(22) International Filing Date: 27 May 2002 (27.05.2002) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 
01114497.9 



15 June 2001 (15.06.2001) BP 



(71) Applicant: F. HOFFMANN-LA ROCHE AG [CH/CH]; 
Grenzacherstrasse 124, CH-4070 Basel (CH). 

(72) Inventors: HOESS, Eva; Zitzelsbei^gerstrasse 13a, 81476 
Muenchen (DE). MEIER, Thomas; Klingerstrasse 25a, 
81369 Muenchen (DE). PESTLIN, Gabriele; Habsburger 
Platz 6. 80801 Muenchen (DE). POPP, Friedrich; Flo- 
hbuhlweg 7, 82377 Penzberg (DE). REICHERT, KUus; 
Heinrichstrasse 24. 82362 Weiheim (DE). SCHMUCK, 
Rainer; Am Klosterweiher 7, 83671 Benediletbeuem (DE). 
SCHNEIDINGER, Bemd; Haarkiichener Strasse, 82069 
Hohenschaeftlam/Neufahm (DE). SEIDEL, Christoph; 
Ammersttasse 39, 82362 Weilheim (DE). TISCHER, 
Wilhelm; Finkenweg 5, 82380 Peissenberg (DE). 



(74) Agent: RAUBER, Beat; Grenzachentrasse 124, CH-4070 
Basel (CH). 

(81) Designated States (national): AE, AG. AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, H, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, PH. PL, PT, RO, RU. SD, SE. SG, SI. 
SK, SL, TJ, TM, TR, TT. TZ. UA. UG. UZ, VN, YU. ZA. 
ZW. 

(84) Designated States (regional)'. ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ. BY. KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE, TR), OAPI patent 
(BF, BJ, CF, CG. CI. CM, GA, GN, GQ, GW, ML, MR. 
NE. SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes an Codes and Abbreviations " appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



vo 

O (54) Title: ACETYLATION OF GP41 FRAGMENTS 

(57) Abstract: A process for the production of an antifusogenic peptide of a length of about 10 to 50 amino acids in a prokaryotic 
— « host cell, characterized in that, under such conditions that inclusion bodies of said non-fusion antifusogenic peptide or said fusion 

peptide arc formed, a) in said host cell there is expressed a nucleic acid encoding said antifusogenic peptide as a non-fusion peptide 

or encoding a fusion peptide of a length of about 14 to 70 amino acids consisting of said antifusogenic peptide N-terminally linked 
^ to a further peptide of a length of about 4 to 30 amino acids; b) said host cell is cultivated; c) said inclusion bodies are recovered 
^ and solubilized; d) in the case of said fusion peptide said antifusogenic peptide is cleaved ofif from said further peptide; and e) said 

antifusogenic peptide is isolated. 
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Acetylation of qp41 fragments 

The invention relates to methods for the recombinant production of peptides which inhibit 
the fusion of viruses with membranes of target cells. In particular, this invention relates to 
the recombinant production of peptidic inhibitors of lentivirus such as human immuno- 
deficiency virus (HIV), Simian immunodeficiency virus (SIV), measles virus (MEV), 
5 influenza viruses such as respiratory syncytical virus (RSV) or human parainfluenza virus 
(HPV). 

Ragkground of the Invention 

Fusion of viruses with cellular membranes is an essential step for the entry of enveloped 
viruses, such as HIV-I, HIV-II, RSV, measles virus, influenza virus, parainfluenza virus, 
10 Epstein-Barr virus and hepatitis virus, into cells. After having entered the cell the cascade of 
viral replication may be initiated resulting in viral infection. 

HIV is a member of the lentivirus genus, which includes retroviruses that possess complex 
genomes and exhibit cone-shaped capsid core partides. Other examples of lentiviruses 
include the simian immunodeficiency virus (SIV), visna virus, and equine infectious anemia 
15 virus (EIAV). Like all retroviruses, HIVs genome is encoded by RNA, which is reverse- 
transcribed to viral DNA by the viral reverse transcriptase (RT) upon entering a new host 
cell Influenza viruses and their cell entry mechanisms are described by Buliough, P.A., et 
ah, Nature 371 (1994) 37-43; Carr, CM., and Kim, P.S., Cell 73 (1993) 823-832j and 
Wilson, I.A., et al.. Nature 289 (1981) 366-373. 

20 All lentiviruses are enveloped by a lipid bilayer that is derived firom the membrane of the 
host cell. Exposed surface glycoproteins (SU, gpl20) are anchored to the virus via interac- 
tions with the transmembrane protein (TM, gp41). The lipid bilayer also contains several 
cellular membrane proteins derived from the host cell, including major histocompatibility 
antigens, actin and ubiquitin (Arthur, L.O., et al, Science 258 (1992) 1935-1938). A matrix 

25 shell comprising approximately 2000 copies of the matrix protein (MA, pl7) lines the inner 
surface of the viral membrane, and a conical capsid core particle comprising ca. 2000 copies 
of the capsid protein (CA, p24) is located in the center of the virus. The capsid particle 
encapsidates two copies of the unspliced viral genome, which is stabilized as a ribonucleo- 
protein complex with ca. 2000 copies of the nucleocapsid protein (NC, p7), and also 

30 contains three essential virally encoded enzymes: protease (PR), reverse transcripUse (RT) 
and integrase (IN). Virus particles also package the accessory proteins, Nef, Vif and Vpr. 
Three additional accessory proteins that function in the host cell. Rev, Tat and Vpu, do not 
appear to be packaged. 
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In the case of HIV, viral entry is associated with the HIV envelope surface glycoproteins 
(Lawless, M.K., et al., Biochemistry 35 (1996) 13697-13708; and Turner, B.G., and 
Summers, M.F., J. Mol. Biol. 285 (1999) 1-32). In the case of HIV-I, this surface protein is 
synthesized as a single 160 kD precursor protein, which is cleaved by a cellular protease into 
5 two glycoproteins gp-41 and gp-120. gp-41 is a transmembrane protein, and gp-120 is an 
extracellular protein_which remains non-covalently associated with gp-41 in a trimeric or 
multimeric form (Hammarskjold, M.-L., et al, Biochim. Biophys. Acta 989 (1989) 269- 
280). HIV is targeted to CD4* lymphocytes because the CD4 surface protein acts as the 
cellular receptor for the HIV-I virus. Viral entry into cells is dependent upon gp-120 
10 binding to the cellular CD4^ receptor molecules while gp-41 anchors the envelope glyco- 
protein complex in the viral membrane and mediates membrane fusion (McDougal, J.S., et 
al., Science 231 (1986) 382-385; and Maddon, PJ., et al. Cell 47 (1986) 333-348). 

gp41 is the transmembrane subunit that m^ediates fusion of viral and cellular membranes. 

The gp41 ectodomain core is a six-helix bundle composed of three helical hairpins, each 
15 consisting of an N helix paired with an antiparallel C helix (Chan, D.C., et al.. Cell 89 

(1997) 263-273; Weissenhorn, W., et al, Nature 387 (1997) 426-430; Tan, K., et al, Proc. 

Natl Acad. Sci. USA 94 (1997) 12303-12308). The N helices form an interior, trimeric 

coiled coil with three conserved, hydrophobic grooves; a C helix packs into each of these 

grooves. This structure likely corresponds to the core of the fusion-active state of gp41. 
20 According to Chan, D.C., et al, Proc. Nad. Acad. Sci. USA 95 (1998) 15613-15617, there is 

evidence that a prominent cavity in the coiled coil of the HIV type 1 gp41 is an attractive 

drug target. 

It is assumed that the mechanism by which gp-41 mediates membrane fusion may involve 
the formation of a coiled-coil trimer, which is thought to drive the transition from resting 
25 to fiisogenic states, as is described, for example, for influenza hemagglutinin (Wilson, I. A., 
et al, Nature 289 (1981) 366-373; Carr, CM., and Kim, P.S.. Cell 73 (1993) 823-832; 
BuUough, P.A., et al. Nature 371 (1994) 37-43). - 

C peptides (peptides corresponding to the C helix) of enveloped viruses, such as DP 178 and 
C34, potently inhibit membrane fusion by both laboratory-adapted strains and primary 
30 isolates of HIV-1 (Malashkevich, V.N., et al, Proc. Natl. Acad. Sci. USA 95 (1998) 9134- 
9139; Wild, C.T., et al, Proc. Natl. Acad. Sci. USA 91 (1994) 9770-9774). A Phase I clinical 
trial with the C peptide DP 178 suggests that it has antiviral activity in vivo, resulting in 
reduced viral loads (Kilby, J.M., et al, Nature Medicine 4 (1998) 1302-1307). The structu- 
ral features of the gp41 core suggest that these peptides act through a dominant- negative 



wo 02/103026 



-3- 



PCT/EP02/05782 



mechanism, in which C peptides bind to the central coiled coil of gp41 and lead to its inac- 
tivation (Chan. D.C., et al., Cell 93 ( 1998) 681-684). 

Within each coiled-coil interfece is a deep cavity, formed by a cluster of residues in the N 
helix coiled coil, that has been proposed to be an attractive target for the development of 

5 antiviral compounds. Three residues from the C helix (Trp-628, Trp-631, and Ile-635) 
insert into this cavity and make extensive hydrophobic contacts. Mutational analysis indi- 
cates that two of the N-helix residues (Leu-568 and Trp-571) comprising this cavity are 
critical for membrane fusion activity (Cao, J., et ah, J. Virol. 67 (1993) 2747-2755). There- 
fore, compounds that bind with high affinity to this cavity and prevent normal N and C 

10 helix pairing may be effective HIV-1 inhibitors. The residues in the cavity are highly 
conserved among diverse HIV-1 isolates. Moreover, a C peptide containing the cavity- 
binding region is much less susceptible to the evolution of resistant virus than DP 178, 
which lacks this region (Rimsky, L.T., et al., J. Virol. 72 (1998) 986-993). These observa- 
tions suggest that high-affinity ligands targeting the highly conserved coiled-coil surface, 

15 particularly its cavity, will have broad activity against diverse HIV isolates and are less likely 
to be bypassed by drug-escape mutants. 

Fusogenic structures of envelope fusion proteins was shown from influenza, Moloney 
murine leukemia virus, and simian immunodeficiency virus (cit. in Chan, D.C., Proc. Natl. 
Acad. Sci. USA 95 (1998) 15613-15617), human respiratory syncytial virus, Ebola, human 
20 T cell leukemia virus, simian parainfluenza. It indicates a close relationship between the 
families of orthomyxoviridae, paramyxoviridae, retroviridae, and others like filoviridae, in 
which viral entry into target cells is enabled by like transmembrane glycoproteins such as 
gp41 of HIV-1, hemagglutinin of influenza, GP2 of Ebola and others (Zhao, X., et al, Proc. 
Natl. Acad. Sci. USA 97 (2000) 14172-14177). 

25 In the state of the art, methods are described for the preparation of peptidic inhibitors 
(C-peptides) (see, e.g., Root, M.J., et al.. Science 291 (2001) 884-888; Root et al. describe 
peptide C37-H6 which is derived from HIV-1. HXB2 and contains residues 625-661. It was 
recombinantly expressed as N40-segement with a GGR-linker and a histidine tag, expressed 
in E.coli and purified from the soluble fraction of bacterial lysates. Zhao, X., et al. describe 

30 in Proc. Natl. Acad. Sci. USA 97 (2000) 14172-14177 a synthetic gene of recRSV-l (human 
respiratory syncytial virus) which encodes Residues 153-209, a G-rich linker, residues 476- 
524, Factor Xa cleavage site and a his-tag. Chen, C.H., et al., describe in J. Virol. 67 (1995) 
3771-3777 the recombinant expression of the extracellular domain of gp41 synthesized as 
fusion protein, residues 540- 686, fusion to MBP. 
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<A number of peptidic inhibitors, also designated as antifusogenic peptides, of such 
membrane fusion-associated events are known, including* for example, inhibiting retrovi- 
ral transmission to uninfected cells. Such peptides are described, for example, by Lambert, 
D.M., et al, Proc. Natl. Acad Sci. USA 93 (1996) 2186-2191, in U.S. Patent Nos. 6.013,263; 
5 6.017,536; and 6,020,459; and in WO 00/69902. WO 99/59615 and WO 96/40191. Further 
peptides inhibiting fusing associated events are described, for example, in U.S. Patent Nos. 
6,093.794; 6,060,065; 6,020,459; 6,017,536; 6,013,263; 5,464.933; 5,656,480; and in 
WO 96/19495. 

Examples of linear peptides derived from the HIV-I gp*4I ectodomain which inhibit viral 
10 fusion are DP- 107 and DP- 178. DP- 107 is a portion of gp-41 near the N-terminal fusion 
peptide and has been shown to be helical, and it strongly oligomerizes in a manner consis- 
tent with coiled-coil formation (Gallaher, W.R., et al, Aids Res. Hum. Retrovirus 5 (1989) 
431-440, Weissenhorn, W., et al, Nature 3«7 (1997) 426-430). DP-178 is derived from the 
C-terminal region of the gp-41 ecto-domain. (Weissenhorn, W., et al.. Nature 387 (1997) 
15 426-430). Although without discernible structure in solution this peptide and constrained 
analogs therefrom adopt a helical structure, bind to a groove of the N-terminal coiled-coil 
trimer of gp41 and thus prevent the gp41 to transform into the fusogenic state (Judice, J. 
K., et al., Proc. Natl. Acad. Sci. USA 94 (1997) 13426-13430). 

Such short-chain peptides usually are prepared by chemical synthesis. Chemical synthesis is 
20 described, for example, by Mergler, M.. et al., Tetrahedron Letters 29 (1988) 4005-4008 and - 
4009-4012; Andersson, L, et al.. Biopolymers 55 (2000) 227-250; and by Jones, J.H., J. Pept. 
Sci. 6 (2000) 201-207. Further methods are described in WO 99/48513. 

However, chemical peptide synthesis suffers from several drawbacks. Most important is 
racemization, which results in insufficient optical purity. In peptide chemistry, racemiza- 
25 tion also means epimerization at one of several chirality centers. If only 1% racemization 
occurs for a single coupling step, then at 100 coupling steps only 61% of the target peptide^ 
would be received (Jakubke, H.D., Peptide, Spektrum Akad. Verlag, Heidelberg (1996), p. 
198). It is obvious that the number of impurities increases with growing chain length and 
their removal is more and more difficult and costly. 

30 Chemical synthesis on large scale is limited by high costs and lack of availability of 
protected amino acid derivatives as starting materials. On the one hand, these starting 
materials should be used in excess to enable complete reactions, on the other hand, their 
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use should be balanced for cost reasons, safety and environmental aspects (Andersson et al., 
Biopolymers 55 (2000) 227-250). 

Peptides may also be produced by recombinant DNA technology. Whereas recombinant 
production of soluble proteins of chain lengths of more than 50 amino acids is known from 

5 the state of the art, the production of peptides with fewer than 50 amino acids suffers from 
several drawbacks (Doebeli, H., et al, Protein Expression and Purification 12 (1998) 404- 
414). Such short or medium chain peptides are usually not stably expressed. They are 
attacked by intrinsic peptidases and degraded. This may result from their small size or lack 
of highly ordered tertiary structure (WO 00/31279). Other authors have found that 

10 recombinant production of peptides requires a minimum chain length of 60 to 80 amino 
acids for a stable expression, and it is further common knowledge that such peptides are 
produced as soluble peptides and not as inclusion bodies (see, e.g., van Heeke, G., et al, 
Methods in Molecular Biology 36 (1994) 245-260, eds. B.M. Dunn and M.W. Pennington, 
Humana Press Inc., Totowa, NJ.); and Goldberg et al. Maximizing Gene Expression 

15 (1986), pp. 287-314, eds. Reznikoff, W., and Gold, L, Butterworks, Stoneham, MA). 
Recombinant production of shorter peptides is especially not successful because if such 
peptides are expressed in prokaryotes, they remain soluble and are immediately degraded 
by prokaryotic peptidases. To avoid this problem, according to common knowledge, such 
peptides are expressed as large (more than 150 to 200 amino acids) fusion proteins, 

20 whereby the fusion tail either renders the ftision protein fairly soluble and avoids the 
formation of inclusion bodies or the fusion tail is a protein which forms during 
recombinant expression in prokaryotes, inclusion bodies, and therefore fiision protein 
consisting of such fiision tail and the desired short peptide will also form inclusion bodies 
during overexpression in prokaryotes. A great disadvantage of such methods is that the 

25 molecular weight of the fiision tail is considerably higher than the molecular weight of the 
desired peptide. Therefore, the yield of the desired peptide is very low and the excess of 
cleaved fiision tail has to be separated of. 

Lepage, P., et al., in Analytical Biochemistry 213 (1993) 40-48. describe recombinant 
methods for the production of HlV-lRev peptides. The peptides are expressed as fijsion 

30 proteins with the synthetic immunoglobulin type G (IgG) binding domains of Staphylo- 
coccus aureus protein A. The peptides have a length of about 20 amino acids, whereas the 
IgG-binding part has a length of about 170 amino acids, so that the expressed fiision 
protein has an overall length of about 190 amino acids. This fusion protein is expressed, 
secreted in soluble form in the medium, and purified by affinity chromatography. The 

35 authors reported that with this method it might be possible to produce recombinant 
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protein in an amount of hundreds of milligrams per liter of culture. However, this 
methodology is limited due to alternative processing within the signal peptide sequence 
and several post-translational modifications of the fused proteins as well as of the cleaved 
peptides. Assuming an average molecular weight of an amino acid of llODaltons, the 
5 desired peptides have a molecular weight of about 2,000 to 5>000 Daltons, whereas the 
fusion tail has a length of at least 170 amino acids (about 19,000 D), if the IgG binding 
domains of Staphylococcus aureus protein A is used as such a fusion tail. Therefore, only 10 
to 25% of the recombinantly produced protein is the desired peptide. 

EP 0 673 948 describes the recombinant production of a gp41 peptide as a fusion protein 
10 with S-galactosidase using the expression vector pSEM3 (Knapp, S., et al., BioTechniques 8 
(1990) 280-281). This fusion protein contains a large part of 6-galactosidase gene, encoding 
the N-terminal 375 amino acids and additional 23 codons of a polylinker sequence. 

Further examples and methods for the recombinant production of small peptides via large 
fusion proteins in £.coli are described by Uhlen and Moks, „Gene Fusions For Purposes of 
15 Expression, An Introduction" in Methods in Enzymology 185 (1990) 129-143, Academic : 
Press, In regard to the production via the "inclusion body" way, Uhlen and Moks refer to 
large fusion products involving fusion parts like trpE, cll and again G-galactosidase. Ningyi, 
L., et ah, Gaojishu Tongxun 10 (2000) 28-31 describe the recombinant expression of p24 
gag gene in E.coli. 

20 It was therefore the object of the present invention to provide a method which enables the 
recombinant production of a high yield of antifiisogenic peptides via the inclusion body 
route and which is suitable for the large-scale industrial production of such peptides. 

Summary of the Invention 

The invention therefore provides a process for the production of an antifusogenic peptide 
25 as a fusion peptide of a length of about 14 to 70 amino acids in a prokaryotic host cell, 
characterized in that, under such conditions that inclusion bodies of said fusion peptide are 
formed, 

a) in said host cell there is expressed a nucleic acid encoding said fusion peptide 
consisting of said antifusogenic peptide of a length of about 10 to 50 amino acids 

30 N- terminally linked to a further peptide of a length of about 4 to 30 amino acids; 

b) said host cell is cultivated; 
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c) said inclusion bodies are formed, recovered and solubilized; 

d) said fusion peptide is isolated. 

Preferably said antifusogenic peptide is cleaved of from said further peptide during or 
5 after solubilization of the inclusion bodies. 

Preferably, the antifusogenic peptide is a fragment of the C helix of a transmembrane 
subunit of an envelope fusion protein from a virus of the lentivirus genus. 

Said further peptide, if expressed recombinantly in said host cell as a non-fiision peptide, 
would not be found in the form of inclusion bodies since it is very short. 

10 In a preferred embodiment of the invention, the further peptide consists of a first and/or 
second part, 

(a) wherein said first part is a stretch of from 1 to 20 amino acids, preferably hydrophilic 
amino acids influencing the isoelectric point of the fusion peptide, and/or said first 
part differing significantly from the antifusogenic peptide with regard to solubility, 

15 interactions with chromatographic separation resins and/or improves access of 

cleavage proteases (Polyak, S.W., et al., Protein Eng. 10 (1997) 615-619) and 

(b) wherein said second part is a cleavable peptidic linker region from 1 to 10 amino 
acids (see, e.g., Table 3) and is located adjacent to the N-terminus of the fiisogenic 
peptide and the C-terminus of the first part. 

20 

If the further peptide consists only of the first or the second part, said part has a length of at 
least 4 amino acids (including start codon coded methionine) preferably for the purpose of 
stabilizing during expression mRNA. Said 4 amino acids include preferably at least one 
arginine. 

25 

In a further embodiment of the invention, the antifusogenic peptide contains a glycine 
attached to its C-terminus. This glycine is useful for the purpose of enzymatic C-terminal 
amidation. 

In a preferred embodiment of the invention, the ratio of the molecular weight of the anti- 
30 fusogenic peptide and the molecular weight of the further peptide in the fusion peptide is 
10 : 1 to 1 : 2. 
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Further preferred embodiments of the invention comprise a prokaryotic expression vector 
containing a nucleic acid encoding a fusion peptide according to the invention and its use 
for the recombinant production of said peptides. 

Preferred embodiments of the invention comprises a preparation of inclusion bodies of a 
5 fusion peptide according to the invention and methods for the production of such 
inclusion bodies. 

A further preferred embodiment of the invention comprises a nucleic acid encoding a 
fusion peptide according to the invention. 

Detailed Description of the Invention 

10 

It was surprisingly found that short-chain antifiisogenic peptides (preferably of a length of 
10 to 50, more preferably of a length of 10 to 40 amino acids) can be expressed successfully 
as short fusion peptides of a length of up to 70 amino acids in prokaryotes such as £.coIi via 
the inclusion body route. 

15 According to the invention, a fusion peptide consisting of an antifusogenic peptide 
N-terminally linked to a further peptide is overexpressed in prokaryotes under conditions, 
whereby insoluble protein inclusion bodies are formed. Inclusion bodies are found in the 
cytoplasm if an expression vector which does not contain a signal sequence is used, which . 
otherwise might enable soluble secretion of the protein into the periplasm or the medium. 

20 These inclusion bodies are separated from other cell components, for example by centrifii- 
gation after cell lysis. The inclusion bodies are solubilized by denaturing agents such as 
guanidine hydrochloride, urea, substances such as arginine, or strong bases such as KOH or 
NaOH. After solubilization, such proteins usually have to be refolded by dilution or by 
dialysis. As the fusion peptide produced according to the method of the invention is a 

25 short-chain peptide without disulfide bridges, renaturation is not necessary. After solubili- 
zation, the solution conditions, such as the pH, etc. are simply modified in such a way that 
cleavage of the further peptide is possible, and cleavage is performed if requested. The 
fusion peptide or antifusogenic peptide can then be recovered from this solution in a 
simple fashion, for example by size exclusion chromatography, ion exchange chromator- 

30 graphy or reversed phase chromatography. 

"Antiftisogenic" and "anti-membrane fusion" as used herein refer to a peptide's ability to 
inhibit or reduce the level of fusion events between two or more structures, e.g., cell 
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membranes or viral envelopes or pili relative to the level of membrane fusion which occurs 
between the structures in the absence of the peptide. Examples hereof are peptidic inhibi- 
tors of Antiviruses such as human immunodeficiency, virus (HIV), respiratory syncytical 
virus (RSV), human parainfluenza virus (HPV), measles virus (MEV), and Simian immu- 
5 nodeficiency virus (SIV). Such antifiisogenic peptides are derived from C helix of a trans- 
membrane subunit of an envelope fusion protein from a virus of the lentivirus genus and 
bind to the central coiled coil of the transmembrane subunit of the respective virus. 

Especially preferred are HIV-1 antifusogenic peptides. Table 1 describes examples of HIV- 1 
antifasogenic peptides derived from the C-peptide of gp41. These antifusogenic peptides 
10 and fragments thereof are particularly useful in the invention. 



Table 1 



Name* 


Name 


Amino acid sequence (one-letter code) 


SEQIDNO: 


T-1249 


T1357" 


WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASIiWEWF 


7 


T-20. DP178 




YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 


8 


T-118 


RSVllS'^ 


FDASISQVNEKINQSLAFIRKSDELLHNVNAGKST 


9 


T-257 


MV257'* 


LHRIDLGPPI SLERLDVGTNLGNAI AKLEDAKELL 


10 



* for N-terminally acetylated and/or C-terminally amidated peptide; T-20 (synonymous, 
with DP178) and T-1249 are from human immunodeficiency virus type 1 (HIV-1), T- 
15 118 is from respiratory syncytial virus (RSV) and T-257 is from measles virus (MV) 

WO 99/59615 

Rimsky, LT., et at, J. Virol 72 (1998) 986-993 
^> Lambert. D.M., Proc.Nad. Acad. Sci. USA 93 (1996)2186-2191 

20 The further peptide according to the invention is a peptide which is added N-terminally to 
the antifusogenic peptide not for purposes of improving the formation of inclusion bodies. 
Its purpose is to, for example, improve expression mRNA stabilization, purification (e.g. a 
His-tag; see, e.g., Zhang, J.-H., et al., Nanjing Daxue Xuebao, Ziran Kexue 36(4) (2000) 
515-517) or to allow subsequent N-terminal modification like acetylation or PEGylation. 

25 The ftirther peptide according to the invention is a short peptidic stretch consisting of at 
least four amino acids (methionine and three further amino acids for cleavage mRNA 
stabilization and/or expression purposes) to about 30 amino acids preferably to about 20 
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amino acids (with regard to the nucleic acid codons level). The length of the further 
peptide is not critical for the invention as long as this peptide is a peptide which would be 
formed in soluble form in the cytoplasm during recombinant expression in prokaryotes 
under conditions where large proteins such as immunoglobulins, streptavidin or tissue- 

5 type plasminogen activator are produced as inclusion bodies. However, it is preferred that 
the further peptide is very short for improving the yield of the antifusogenic peptide. (If the 
ratio of length of further peptide to antifusogenic peptide is 3:1, the amount of 
antifusogenic peptide in relation to the amount of the fusion peptide recovered is about 
50% lower than if the ratio is 1 : 1 if the same amount of fusion peptide is produced.) The 

10 length of the further peptide is selected such that the total length of the fusion peptide does 
not exceed about 70 amino acids, preferably 50 and most preferably 40 amino acids, for 
reasons of obtaining acceptable yields of antifusogenic peptides. Therefore, it is especially 
preferred that the further peptide consists only of an appropriate cleavage site, of some 
amino acids for improving expression and solubility of the fusion peptide, faciUtating the 

15 solubilization process of its inclusion bodies or to improve access of deavage proteases 
(avoidance of steric hindrance) and/or of some amino acids such as a His-tag for 
purification means (Hengen, P., Trends Biochem. Sci. 20 (1995) 285-286) and methionine 
necessary and encoded by the start codon. 

The further peptide is a short-chain peptide which, if overexpressed alone in prokaryotes 
20 such as E.coli and without a signal sequence, would not be formed as inclusion bodies but 
remains soluble in the cytoplasm or is rapidly degraded in the cytoplasm. Such proteins do 
not form a fbced denatured tertiary structure, therefore they cannot form a fixed tertiary 
structure which is poorly soluble, therefore remain soluble and would be degraded by E.coli 
proteases if expressed. 

25 The further peptide according to the invention consists preferably of amino acids which do 
not form a fixed tertiary conformation which might sterically hinder access of the cleavage 
agent to the cleavage site between the fusogenic peptide and the further peptide. For this 
reason, the further peptide is preferably free of cysteine residues. Cysteine contains a 
sulfhydryl or thiol group which is highly reactive and can form disulfide bonds. The 

30 presence of cysteine can work against the desired lack of fixed secondary or tertiary 
conformation of the further peptide and therefore its use is avoided. 

In a preferred embodiment of the present invention, the N-terminally attached further 
peptide stretch contains a sequence at its C-terminus, which is cleavable easily by enzymatic 
or chemical means. 
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The further peptide according to the invention is used in the fusion peptide also preferably 
for protection of the N-terminus of the antifiisogenic peptide during expression, solubili- 
zation, purification and peptide modification. Such fusion peptides are especially valuable 
in a process for the production of N-terminally modified antifiisogenic peptides. Such a 

5 method involves forming the recombinant polypeptide as a fusion peptide, which fusion 
part protects the N-terminus. The recombinant fusion peptide can then be reacted with up 
to three chemical protecting agents to selectively protect reactive side-chain groups and 
thereby prevent side-chain groups from being modified. Then the fusion peptide is cleaved 
with at least one cleavage reagent between the peptide and the fusion part to form an 

10 unprotected terminal amino acid reactive group. The unprotected terminal amino acid 
reactive group is modified with at least one chemical modifying agent such as acetic 
anhydride or acetic N-hydroxysuccinimide ester for N-terminal acetylation. The side- 
chains are then deprotected to form a N-terminally modified peptide. Such methods are 
described, for example, in WO 94/01451; U.S. Patent No. 5,635,371; and U.S. Patent No. 

15 5,656,456. 

The further peptide part preferably has such a structure that it facilitates the purification of 
the fusion peptide. The fiirther peptide preferably contains for this purpose an "affinity tag" 
(cf. Pandjaitan, B., et al, Gene 237 (1999) 333-342), such as polyhistidine (about 6 His 
residues) or the like. 

20 It is preferred to select the further peptide in such a way that its isoelectric point (IP) differs 
from the IP of the antifiisogenic peptide for easy expression and separation of the 
fi-agments after cleavage. Table 2 shows IP's of different ftision peptides and the IP's of 
non-fiision fiisogenic peptides T680 and T1357 (nomenclature according to WO 96/40191 
and WO 99/59615). Preferably, the IP of the fiision peptide and the antifiisogenic portion 

25 of the fusion peptide differ by about one pH unit preferably by about 1 to 2 pH units. Such 
IP shift can preferably performed by basic amino acids and/or histidins contained in the 
fiirther peptide. 
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Table 2 

Calculating the isoelectric points of antifiisogenic peptides 



SEQIDNO: 


Sequence^* 


Calculated 


8 


T680 


4.13 


15 


M-HHHHHH-IEGR-r5«0-G 


6.13 




T1249 


5.1" 


7 


T1357 


4.53 


14 


M-HHHHHH-IEGR-T2357-G 


6.23 


2 


MRGS-HHHHHH-AIVD-IEGR-ri357-G 


6.23 



5 Experimentally determined. 

Amino acids in one-letter code; inverse parts are peptide names. 

In a further preferred embodiment of the invention, the antifiisogenic peptide contains a 
glycine at its C- terminus. This glycine is useful for the purpose of subsequent enzymatic C- 
terminal amidation (Bradbury, A.F., and Smyth, D.G., Trends Biochem. Sci. 16 (1991) 112- 
10 115). 

Especially preferred fusion peptides according to the invention are (amino acids in 
standard one-letter code): 



MRGS-HHHHHH-AIDV-IEGR-rJ357-G, (SEQ ID NO:2) 

MRGS-HHHHHH-AIDV-IEGR-RSV]J«-G (SEQ ID NO:ll) 

15 MRGS-HHHHHH-AIDV.IEGR-MV257-G (SEQ ID NO:12) 

M.HHHHHH-AIDV-IEGR-T580-G, (SEQ ID NO: 13) 

M-HHHHHH-IEGR- TJ357-G (SEQ ID NO:14) 



said fusion peptides without the poly HIS tag (His 6), or with the first four amino acids of 
20 SEQ ID NO:2, 1 1 or 12 as further peptide in the fusion peptides. 

Annotation: The inverse parts are peptide names, and therefore the letters included in these 
parts do not constitute one-letter codes for amino acids. 

25 There exist a large number of publications which describe the recombinant production of 
proteins in prokaryotes via the inclusion body route. Examples of such publications are 
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Misawa, S., et al., Biopolymers 51 (1999) 297-307; Lilie, H., Curr. Opin. Biotechnol. 9 
(1998) 497-501; Hockney, R.C., Trends Biotechnol. 12 ( 1994) 456-463. 

The fusion peptides according to the invention are overexpressed in prokaryotes. Overex- 
pression without a signal peptide leads to the formation of inclusion bodies. Methionine 
5 coded by the start codon and mentioned in the examples above can be cleaved during 
further processing- General methods for overexpression of proteins in prokaryotes have 
been well-known in the state of the art for a long time. Examples of publications in the field 
are Skelly, J.V., et al., Methods Mol. Biol. 56 (1996) 23-53 and Das, A., Methods EnzymoL 
182(1990) 93-112. 

10 Overexpression in prokaryotes means expression using optimized expression cassettes with 
promoters such as the tac or lac promoter (EP-B 0 067 540). Usually, this can be performed 
by the use of vectors containing chemical inducible promoters or promoters inducible via 
shift of temperature. One of the useful promoters for E.coli is the temperature-sensitive 
XPL promoter (cf. EP-B 0 041 767). A further efficient promoter is the tac promoter (cf. 

15 U.S. Patent No. 4.551,433). Such strong regulation signals for prokaryotes such E.coli 
usually originate from bacteria-challenging bacteriophages (see Lanzer, M., et al., Proc. 
Natl. Acad. Sci. USA 85 (1988) 8973-8977; Knaus, R., and Bujard, H., EMBO Journal 7 
(1988) 2919-2923; for the kT7 promoter Studier, F.W., et al„ Methods EnzymoL 185 
(1990) 60-89); for the T5 promoter EP 0 186 069). 

20 By the use of such overproducing prokaryotic cell expression systems the fusion peptides 
according to the invention are produced at levels at least comprising 10% of the total 
expressed protein of the cell, and typically 30-40%. and occasionally as high as 50%. 

"Inclusion bodies" ((IBs) as used herein refers to an insoluble form of polypeptides recom- 
binantly produced after overexpression of the encoding nucleic acid in prokaryotes. This 
25 phenomenon is widely known in the state of the art and is reviewed, for example, by 
Misawa S., and Kumagai, I., Biopolymers 51 (1999) 297-307); Guise, A.D., et al., Mol 
Biotechnol. 6 (1996) 53-64; and Hockney et al.. Trends Biotechnol. 12 (1994) 456-463. 

Solubilization of the inclusion bodies is preferably performed by adding denaturing agents 
such as urea, guanidine hydrochloride or alkaline solutions of KOH or NaOH (Guise, A.D., 
30 et al, Mol. Biotechnol. 6 (1996) 53-64; Fischer, B., et al., Arzneimittelforschung 42 (1992) 
1512-1515). 
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The fusion peptides according to the invention can be cleaved enzymatically after solubili- 
zation with a specifically cleaving protease (restriction protease). The proteinase is selected, 
taking into consideration the amino acid sequence of the antifusogenic peptide to be 
produced. Care must be taken that> if possible, the recognition/cleavage sequence of the 
5 restriction proteinase does not occur in the antifusogenic peptide, and preferably also not 
in the further peptide, i.e., it should only occur once in the cleavage region (linker region). 
Suitable specifically cleaving endoproteinase are, for example, Factor Xa, thrombin, 
subtilisin, BTN variant/ubiquitin protease peptidase, rennin, coUagenase, trypsin, chymo- 
trypsin, endoproteinase Lys-C, kallikrein (Carter, P.: In: Ladisch, M.R.; Willson, R.C.; 
10 Painton, C.C.; Builder, S.E., eds., Protein Purification: From Molecular Mechanisms to 
Large-Scale Processes; ACS Symposium Series No. 427, American Chemical Society, pp. 
181-193 (1990)), TEV proteinase (Parks, T.D., et al., Anal Biochem. 216 (1994) 413-417), 
IgA protease (Pohlner, J., et al., Nature 325 (1987) 458-462), Kex2p proteinase 
(EP-A 0 467 839) or S. aureus V8 proteinase. 

15 In addition to being cysteine-free, the sequence of the further peptide may preferably 
exploit other design strategies which promote efficient cleavage in the preselected cleavage 
environment. Particularly if the preselected cleavage agent is an endopeptidase, it is 
preferred that the further peptide is soluble in aqueous environments. Amino acids having 
charged side-groups and hydrophilic properties are, therefore, preferably included in the 

20 further peptide to promote solubility or any other amino acids which promote the access of 
cleavage proteases. Such amino acids are, for example, Glu and Asp (anionic), Arg and Lys, 
and Ser and Thr (neutral, hydrophilic). If arginine is used, it must be taken into account 
that arginine constitutes the trypsin cleavage site. Therefore, in such cases where the further 
peptide should contain arginine, trypsin should be avoided as cleaving protease. The use of 

25 lysine, too, is subject to limitations. If the peptide has to be N-terminally modified (see 
above), there may be a need for protecting lysine groups. Therefore, in such cases, it is 
preferred to avoid lysine in the further peptide. 

The cleavage site is typically selected so that it is not contained in the antifusogenic peptide 
and is preferably contained in the further peptide. Chemical and enzymatic cleavage sites 
30 and the corresponding agents used to effect cleavage of a peptide bond close to one of the 
sites are described, for example, in WO 92/01707 and WO 95/03405. 
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Examples for cleavage enzymes and the cleavage sequence are described in Table 3 below: 



Table 3 



Pnzvme 


Cleavage sequence 


SEQIDNO: 


Enterokinase 


DDDDK 


16 


Factor Xa 


lEGR 


17 


Thrombin 


RGPR 


18 


Ubiquitin 


RGG 




Rennin 


HPFHL-LVY 


19 


Trypsin 


DorR 




Chymotrypsin 


ForYorW 




Clostripain 


R 




S. aureus V8 


G 





5 Chemical cleavage: 



Cleavage substance 


Cleavage sequence^^ 


BrCN 


M 


BNPS-skatole 


W 


5-Nitro-5- 
thiocyanobenzoate 


C 



Amino acid(s) in one-letter code. 

Trypsin is preferably used, which specifically cleaves proteins and peptides at the 
C-terminal end of arginine. Such an enzyme is known, for example, fi-om porcine or bovine 
pancreas or recombinant yeasts. Trypsin is particularly suitable for producing the desired 
10 polypeptides, if the lysine residues are protected and attacked by the enzyme. 

The peptide sequence which can be cleaved by an endoproteinase is understood within the 
sense of the present invention as a short-chain peptide sequence which is preferably 
composed of 1 to 20 amino acids preferably 1 to 10 amino acids and contains a C-terminal 
cleavage site for the desired endoproteinase. This further peptide preferably additionally 
15 contains a combination of several amino acids (first part) between the N-terminal end and 
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the desired endoproteinase recognition sequence, preferably selected from amino acids Gly, 
Thr, Ser, Ala, Pro, Asp, Glu, Arg and Lys. An amino acid stretch in which two to eight of 
these additional amino acids are the negatively charged amino acids Asp and/or Glu is 
preferably used as the first part. 

5 Cleavage is also possible using BrCN (chemical cleavage) as long as the antifiisogenic 
peptide does not contain methionine. 

The fusion peptides are produced by expression of a DNA (nucleic acid sequence) which 
codes for the peptide in prokaryotes. The expression vector does not contain any elements 
that would mediate secretion of the fusion peptide into the medium or periplasm (e.g., 
10 signal peptides). Therefore, the peptide is formed as insoluble refractile bodies (IP's). 

DNA encoding the fusion peptide can be produced according to the methods known in the 
state of the art. It is further preferred to extend the nucleic acid sequence with additional 
regulation and transcription elements, in order to optimize the expression in the host cells. 
A DNA that is suitable for the expression can preferably be produced by synthesis. Such 
15 processes are familiar to a person skilled in the art and are described for example in Beattie, 
K. L., and Fowler, R. F., Nature 352 (1991) 548-549; EP-B 0 424 990; Itakura, K.> et al., 
Science 198 (1977) 1056-1063. It may also be expedient to modify the nucleic acid sequence 
of the peptides according to the invention. 

Such modifications are, for example: 

20 - modification of the nucleic acid sequence in order to introduce various recognition 
sequences of restriction enzymes to facUitate the steps of ligation, cloning and 
mutagenesis; 

- modification of the nucleic acid sequence to incorporate preferred codons for the host 

cell; 

25 - extension of the nucleic acid sequence with additional regulation and transcription 
elements in order to optimize the expression in the host cell. 

All further steps in the process for the production of suitable expression vectors and for the 
expression are state of the art and familiar to a person skilled in the art. Such methods are 
described for example in Sambrook et al., Molecular Cloning: A Laboratory Manual 
30 ( 1 989), Cold Spring Harbor Laboratory Press, New York, USA. 
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As the fusion peptide is not secreted, it aggregates in the cell, preferably in the cytoplasm. 
Here the peptide is stored in a compressed and insoluble form (inclusion bodies). This 
reduces interference with cell functions like proteolytic degradation to a minimum. 

Escherichia coli, Streptomyces or Bacillus are for example suitable as prokaryotic host 
5 organisms. For the production of the fusion peptides according to the invention the 
prokaryotes are transformed in the usual manner with the vector which contains the DNA 
coding for the peptide and subsequently fermented in the usual manner. After lysis of the 
cells the insoluble inactive peptide (IBs) is isolated in the usual manner for example by 
centrifiigation (pellet fraction). The desired insoluble peptide aggregates can if necessary be 
10 further enriched by washing the pellets e.g. with buffers containing detergents. 

The insoluble fusion peptide is solubilized preferably with alkaline solutions (e.g., KOH, 
pH 10) and cleaved by appropriate means, $.g., &ctor Xa at pH 8.0. 

Surprisingly it has turned out that the fusion peptides produced by the process according to 
the invention are not degraded in the host cells formed as inclusion bodies and, 
15 subsequently, can be completely cleaved enzymatically without significant cleavage in the 
antifusogenic peptide component itself. 

The following examples, references, Fig. 1 and the sequence listing are provided to aid the 
understanding of the present invention, the true scope of which is set forth in the appended 
claims. It is understood that modifications can be made in the procedures set forth without 
20 departing from the spirit of the invention. 

Fi gure 1 shows the expression vector encoding Xa-T 1357. 

Sequence Listing 

SEQ ID NO:l Synthetic gene Xa-T 1357. 
SEQ ID NO:2 Peptide Xa-Tl357. 
25 SEQIDN0:3 Primer 1. 



SEQIDN0:4 Primer 2. 
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SEQ ID N0:5 Primer 3. 

SEQIDN0:6 Primer 4. 

SEQIDNO:? Peptide T1357. 

SEQIDNO:8 Peptide T680. 

5 SEQIDN0:9 Peptide RSVl 18. 

SEQIDNO:10 Peptide MV257. 

SEQ ID N0:11 Peptide Xa-RSVl 18. 

SEQIDN0:12 Peptide Xa-MV257. 

SEQ ID N0:13 Peptide M-HHHHHH-AIDV-IEGR-T1357-G. 

10 SEQIDN0:14 Peptide M-HHHHHH-IEGR-T1357-G. 

SEQ ID N0:15 Peptide M-HHHHHH-IEGR-T680-G. 

SEQ ID NO: 16 Cleavage sequence. 

SEQIDN0:17 Cleavage sequence. 

SEQ ID NO: 18 Cleavage sequence. 

1 5 SEQ ID NO: 19 Cleavage sequence. 
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Expression vector construction 

The synthetic gene Xa-T1357 has the following structure (from the N-terminus to the 
C-terminus): 

5 

- EcoRI cleavage site 

- ribosome binding site of phage T5 

- amino acids M, R, G, S 

- amino acids 6 x H (His-tag) 
10 - amino acids A, I, D, V 

- factor Xa linker I, E, G, R 

- T1357 sequence (39 amino acids) 

- amino acid G 

- 2 stop codons 

15 - BamHI cleavage site 

and is described by SEQ ID NO: 1 . 

The synthetic gene was constructed via gene synthesis with the use of four oligonucleotides 
20 consisting of the sequence of the gene with three overlapping regions of 20, 21 and 20 base 
pairs. Synthesis was performed by means of a two-step PGR reaction. In the first step, oligo 
1 and 2 and oligo 3 and 4 each were applied as a template for complete synthesis of the N- 
terminal portion of the gene and the C-terminal portion of the gene, respectively. The 
products were used as a template for the second step, whereby equal portions of these 
25 products were applied. Primers 1 and 4 were used as the synthesis primers for this step. 

The resulting synthetic gene was digested with EcoRI and BamHI, and so was the vector 
pQE-30 (Qiagen, DE). Both restriction digestions were purified by means of gel extraction 
(Qiagen) and then used for ligation for the production of an expression vector. The ratio of 
the insert to the vector was 3:1. The vector is shown in Fig. 1. 

30 The sequence and the correct orientation of the construct were determined by means of 
restriction control and sequencing (SequiServe, DE). 
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Example 2 

Fermentation, IB Isolation and Solubilization 
Fermentation and IB isolation: 

Exoli cells containing the expression vector of Example 1 are fermented in complex 
5 medium, with selection using ampicillin and kanamycin. LB medium is used for the pre- 
culture. For the main culture, yeast extract is used as the only complex component as the 
C- and N-source, and glycerine is used as an additional C-source. Both components are 
added in concentrated form during the fed-batch phase. Expression with IPTG is induced 
in a suitable growth phase. The pH value is adjusted to 7 ± 0.2 using phosphoric 
10 acid/caustic soda. Fermentation is conducted with an overpressure of up to 0.5 bar. The 
oxygen partial pressure is controlled via the rotational speed of the stirrer, aeration and/or 
rate of dosage. After growth arrest and after a corresponding period of expression, the 
biomass is harvested, immediately or after cooling, by centrifugation and is stored in a 
frozen state. 

15 The foreign protein to be expressed is obtained intracellularly in the form of inclusion 
bodies (IB), an IB preparation is conducted prior to purification. To this end, the biomass 
is disrupted with the use of a homogenizer and the IB's are purified in several washing and 
centrifugation steps. 

Solubilization 

20 5.21 g of IB's (prepared from 20 g of cells by high-pressure disruption in 100 ml disruption 
buffer (100 mM Tris-HCl, 10 mM EDTA, pH 7.0)) are dissolved in 206 ml of 30 mM KOH 
pH 12 at room temperature, while stirring. 

Protein content solubilisate: c = 3.2 mg/ml (Bradford protein assay, Bradford M.M., 
Analytical Biochemistry 72 (1976) 248-254)-^ 659.2 mg total protein. 
25 Molecular weight of the hision peptide of: MW = 7161.96 (7156.05 monoisotopic) 
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The peptides 

MRGS-HHHHHH-AIDV-IEGR-/ISW18-G (SEQ ID NO:ll) 

MRGS-HHHHHH-AIDV-IEGR-MV257-G (SEQ ID NO:12) 

5 are produced in the same manner, whereby the sequences RSV 1 18 and MV 257 (see 
Table 1) are used instead of the sequence "T 1357". 

Example? 

Citraconylation and cleavage 

a) Citraconylation 

For citraconylation there is applied to the fusion peptide a mixture of citraconic acid 
anhydride (MW: 112.09, p = 1.245 g/ml) and dioxane at a ratio of 1:1, whereby the 
citraconic acid anhydride is used at a 12-fold excess per mole of amino group. For 
citraconylation 0,66 g fusion peptide (MW: 7162, C = 3.2 mg/ml in disruption buffer of 
Example 2b), five amino groups/molecule and 570^1 citraconic acid anhydride are 
required. The solubilisate is adjusted to a pH of 11 by using concentrated HCl. 
Subsequently, a mixture of 570 |il citraconic acid anhydride and 570 ^1 dioxane is added 
dropwise and the solution is buffered with 2 M KOH so that the pH value does not drop 
below 8.5. After adjusting the pH value to pH 10, the preparation is incubated overnight at 
room temperature, while stirring. The reaction is stopped with ethanol amine (21 ml of 
1 M ethanol amine in H2O, pH 8.0) (final concentration of ethanol amine in the solution: 
100 mM) and the pH value is adjusted to pH 8.5 by using 2 M HCl. The citraconylated 
material is then stored at -20**C. 

Total volume of reaction batch: 237 ml. 

b) Cleavage with trypsin 

25 The reaction batch is mixed with 11.85 ml of 1 M NaCl (final concentration 50 mM) and 
with 1.25 ml of 1 M CaCb (final concentration 2 mM) and batches thereof of 10 ml each 
are incubated with 1.3 U/ml of trypsin for 80 minutes at room temperature, while stirring. 
At a level of trypsin of 1.3 U/ml, complete cleavage of the fusion protein has taken place. 
Cleavage is determined by means of 16%-tricin-SDS gels and HPLC analysis. The trypsin 



10 



15 



20 
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applied is trypsin from Roche Diagnostics GmbH (degree of purity II, 0,0091% 
chymotrypsin activity, activity. 206 U/ml). 

c) Determination of cleavage by HPLC analysis 

The portion of peptide is determined by HPLC with a linear calibration curve, which was 
5 prepared with synthetic Ac-Tl357-G-OH. The following HPLC system is used: 

Column: Pharmacia Source 15RPC ST 4.6/100 

Eluent: buffer A: 20 mM Tris, pH 7.5; buffer B: 70% acetonitrile + 30% buffer A 

Gradient: in 28 minutes from 0% B to 100% B 

Flow: 1 ml/min. 

10 Detection: UV226nm 

lBxampk4 

Purification 

a) Ammonium sulfate precipitation 

To the reaction mixture of example 3b solid ammonium sulfate (AS) (final concentration 
15 2 M) is added and, after dissolving the AS, is stirred at room temperature. The precipitated 
peptide is removed by centrifugation (10,000 rpm) for 5 minutes at 4**C and the pellet^ 
obtained (containing peptide and trypsin) is dissolved in 5 ml Tris buffer (50 mM, pH 8.5). 

b) Benzamidine Sepharose 6B chromatography 

Removal of trypsin from the above-described solution is accomplished by means of a 
20 benzamidine column. 

Application: 5 mg of protein 

Column: HR 16 (Pharmacia, h = 3.3 cm, d - 1.6 cm, V = 10 ml) 

Material: Benzamidine Sepharose 6B (Pharmacia No. 17-0568-01), regenerated according 
to the manufacturer's instructions and equilibrated with 50 mM Tris buffer, pH 8.5 
25 Flow: 1 ml/min. 

Washing the column: 50 mM Tris buffer, pH. 8.5 
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40 to 80% of the peptide flows through the column, the rest binds to the column material. 
In the subsequent elution with 60% of ethanol, a possibly present co-elution of the trypsin 
cannot be determined because trypsin is no longer detectable when the content of ethanol 
is 20% or higher. Both in the flow-through and in the elution, the peptide appears as a 
5 single peak in the HPLC analysis, and the peptide is also homogeneous in the SDS gel. 

c) Purification via Phenyl SepharoseFF 

Phenyl Sepharose FastFlow (Pharmacia): 5 mg of the pellet precipitate with AS (protein 
content c = 5 mg/ml) are dissolved in 1 ml Tris buffer (50 mM Tris pH 8.5) and applied to 
10 the column. The peptide binds completely to the column material and can be eluted with 
an ethanol gradient of 20 to 60%, in 50 mM Tris HCl, pH 8.5 the elution taking place over 
a long period of retention. Peptide fractions are homogeneous. 
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Patent Claims 



1. A process for the production of an antifusogenic peptide as a fusion peptide of a 
length of about 14 to 70 amino acids in a prokaryotic host cell, characterized in that, 
under such conditions that inclusion bodies of said fusion peptide are formed, 

5 a) in said host cell there is expressed a nucleic acid encoding said fusion peptide 

consisting of said antifusogenic peptide of a length of about 10 to 50 amino 
acids N-terminally linked to a further peptide of a length of about 4 to 30 
amino acids; 
b) said host cell is cultivated; 
10 c) said inclusion bodies are formed, recovered and solubilized; 

d) said fusion peptide is isolated. 

2. Process according to claim 1, characterized in that the further peptide consists of a 
first and/or second part, 

15 (a) wherein said first part is a stretch of up to 20 amino acids, and 

(b) wherein said second part is a cleavable peptidic linker region up to 10 amino 
acids and is located adjacent to the N-terminus of the fusogenic peptide and the 
C-terminus of the first part. 

3. Process according to claims 1 to 2, characterized in that the ratio of the molecular 
20 weight of the antifusogenic peptide and the molecular weight of the second peptide is 

10 : 1 to 1 : 2. 

4. Process according to claims 1 to 3 characterized in that said antifusogenic peptide is 
cleaved firom said further peptide during or after solubilization of said inclusion 
bodies. 



25 5. A nucleic acid encoding a fusion peptide of a length of about 14 to 70 amino acids 
consisting of a antifusogenic peptide of a length of about 10 to 50 amino acids N- 
terminally linked to a further peptide with a length of 4 to 30 amino acids. 

6. A prokaryotic expression vector containing a nucleic acid encoding an antifusogenic 
peptide of a length of about 10 to 50 amino acids or a fusion peptide of a length of 
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about 14 to 70 amino acids consisting of said antifiisogenic peptide N-terminally 
linked to a further peptide with a length of 4 to 30 amino acids. 

A preparation of inclusion bodies of an antifiisogenic peptide of a length of about 10 
to SO amino acids or a fusion peptide of a length of about 14 to 70 amino acids N- 
terminally linked to a further peptide with a length of 4 to 30 amino acids. 
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SEQUENCE LISTING 

<110> F. HOFFMANN-LA ROCHE 

5 

<120> Methods for the recombinant production of peptidic 
antiviral fusion inhibitors 

<130> Case 20904 

10 

<140> 
<141> 

<160> 22 

15 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 221 

20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic gene 
25 Xa-T 1357 

<220> 

<221> CDS 

<222> (32) . . (211) 

30 

<400> 1 

cgaggaattc attaaagagg agaaattaac t atg aga gga teg cat cat cat 52 

Met Arg Gly Ser His His His 
1 5 

35 

cat cat cat get ate gat gtt att gaa ggc cgt tgg cag gaa tgg gaa 100 
His His His Ala He Asp Val He Glu Gly Arg Trp Gin Glu Trp Glu 
10 15 20 

40 cag aaa att acc gcc ctg ctg gaa cag gcg caa att cag caa gag aaa 148 
Gin Lys He Thr Ala Leu Leu Glu Gin Ala Gin He Gin Gin Glu Lys 
25 30 35 

aac gaa tat gag ctg cag aaa ctg gat aag tgg gcg age ctg tgg gaa 196 
45 Asn Glu Tyr Glu Leu Gin Lys Leu Asp Lys Trp Ala Ser Leu Trp Glu 
40 45 50 55 



tgg ttc ggc taa tga ggatccagct 221 
Trp Phe Gly 
50 60 
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<210> 2 
<211> 58 
<212> PRT 
5 <213> Artificial Sequence 

<223> Description of Artificial Sequence: synthetic gene 
Xa-T 1357 

<400> 2 

10 Met Arg Gly Ser His His His His His His Ala lie Asp Val lie Glu 
15 10 15 

Gly Arg Trp Gin Glu Trp Glu Gin Lys He Thr Ala Leu Leu Glu Gin 

20 25 30 

Ala Gin He Gin Gin Glu Lys Asn Glu Tyr Glu Leu Gin Lys Leu Asp 
15 35 40 45 

Lys Trp Ala Ser Leu Trp Glu Trp Phe Gly 
50 55 



20 

<210> 3 

<211> 70 
<212> DNA 

<213> Artificial Sequence 

25 

<220> 

<223> Description of Artificial Sequence: primer 1 
<400> 3 

30 cgaggaattc attaaagagg agaaattaac tatgagagga tcgcatcatc atcatcatca 60 
tgctatcgat 70 



<210> 4 
35 <211> 70 
<212> DNA 

<213> Artificial Sequence 
<220> 

40 <223> Description of Artificial Sequence: primer 2 
<400> 4 

agcagggcgg taattttctg ttcccattcc tgccaacggc cttcaataac atcgatagca 60 
tgatgatgat 70 

45 

<210> 5 
<211> 71 
<212> DNA 
50 <213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 3 
<400> 5 

5 acagaaaatt accgccctgc tggaacaggc gcaaattcag caagagaaaa acgaatatga 60 
gctgcagaaa c 71 



<210> 6 
10 <211> 71 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence: primer 4 
<400> 6 

agctggatcc tcattagccg aaccattccc acaggctcgc ccacttatcc agtttctgca 60 
gctcatattc g 71 

20 

<210> 7 
<211> 39 
<212> PRT 
25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence rpeptid T1357 
30 <400> 7 

Trp Gin Glu Trp Glu Gin Lys lie Thr Ala Leu Leu Glu Gin Ala Gin 

15 10 15 ■ 

lie Gin Gin Glu Lys Asn Glu Tyr Glu Leu Gin Lys Leu Asp Lys Trp 
35 20 25 30 

Ala Ser Leu Trp Glu Trp Phe 
35 

40 

<210> 8 
<211> 36 
<212> PRT 

<213> Artificial Sequence 

45 

<220> 

<223> Description of Artificial Sequence: peptide T680 



<400> 8 

50 Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin Gin 
15 10 15 
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Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu 
20 25 30 

5 Trp Asn Trp Phe 
35 



<210> 9 
10 <211> 35 

<212> PRT 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence : peptide RSV118 
<400> 9 

Phe Asp Ala Ser lie Ser Gin Val Asn Glu Lys lie Asn Gin Ser Leu 
1 5 ^10 15 

20 

Ala Phe lie Arg Lys Ser Asp Glu Leu Leu His Asn Val Asn Ala Gly 
20 25 30 

Lys Ser Thr 
25 35 



<210> 10 
<211> 35 
30 <212> PRT 

<213> Artificial Sequence 



35 



<220> 

<223> Description of Artificial Sequence: peptide MV257 
<400> 10 

Leu His Arg lie Asp Leu Gly Pro Pro lie Ser Leu Glu Arg Leu Asp 
15 10 15 

40 Val Gly Thr Asn Leu Gly Asn Ala lie Ala Lys Leu Glu Asp Ala Lys 
20 25 30 

Glu Leu Leu 
35 

45 

<210> 11 
<211> 54 
<212> PRT 
50 <213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: pep tidie 
Xa-RSV118 

- 5 <400> 11 

Met Arg Gly Ser His His His His His His Ala He Asp Val He Glu 
15 10 15 

Gly Arg Phe Asp Ala Ser He Ser Gin Val Asn Glu Lys He Asn Gin 
10 20 25 30 

Ser Leu Ala Phe He Arg Lys Ser Asp Glu Leu Leu His Asn Val Asn 
35 40 45 

15 Ala Gly Lys Ser Thr Gly 
50 



<210> 12 

20 <211> 54 

<212> PRT 

<213> Artificial Sequence 
<220> 

25 <223> Description of Artificial Secjuence : peptide 
Xa-MV257 

<400> 12 

Met Arg Gly Ser His His His His His His Ala He Asp Val He Glu 
30 1 5 10 15 

Gly Arg Leu His Arg He Asp Leu Gly Pro Pro He Ser Leu Glu Arg 
20 25 30 

35 Leu Asp Val Gly Thr Asn Leu Gly Asn Ala He Ala Lys Leu Glu Asp 
35 40 45 

Ala Lys Glu Leu Leu Gly 
50 

40 

<210> 13 
<211> 52 
<212> PRT 
45 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : peptide 
M-HHHHHH-AIDV-IEGR-T1357-G 

50 

<400> 13 
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Met His His His His His His Ala He Asp Val He Glu Gly Arg Tyr 
1 5 10 15 

Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu 
5 20 25 30 

Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp 
35 40 45 

10 Asn Trp Phe Gly 
50 



<210> 14 
15 <211> 51 
<212> PRT 

<213> Artificial Sequence 

<220> 

20 <223> Description of Artificial Sequence: peptide 
M-HHHHHH-IEGR-T1357-G 

<400> 14 

Met His His His His His His He Glu Gly Arg Trp Gin Glu Trp Glu 
25 1 5 10 15 

Gin Lys He Thr Ala Leu Leu Glu Gin Ala Gin He Gin Gin Glu Lys 
20 25 30 

30 Asn Glu Tyr Glu Leu Gin Lys Leu Asp Lys Trp Ala Ser Leu Trp Glu 
35 40 45 

Trp Phe Gly 
50 

35 



<210> 15. 
<211> 48 
40 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: pep tide 
45 M-HHHHHH- IEGR-T68 0 -G 

<400> 15 

Met His His His His His His He Glu Gly Arg Tyr Thr Ser Leu He 
15 10 15 

50 
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His Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin 
20 25 30 

Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Gly 
5 35 40 45 



10 

<210> 16 
<211> 42 
<212> PRT 

<213> Artificial Sequence 

15 

<220> 

<223> Description of Artificial Sequence : peptide 
MR-T1357-G 

20 <400> 16 

Met Arg Trp Gin Glu Trp Glu Gin Lys He Thr Ala Leu Leu Glu Gin 
15 10 15 

Ala Gin lie Gin Gin Glu Lys Asn Glu Tyr Glu Leu Gin Lys Leu Asp 
25 20 . 25 30 

Lys Trp Ala Ser Leu Trp Glu Trp Phe Gly 
35 40 

30 

<210> 17 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

35 

<220> 

<223> Description of Artificial Sequence: cleavage 
sequence 

40 <400> 17 

Asp Asp Asp Asp Lys 
1 5 



45 <210> 18 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
50 <220> 

<223> Description of Artificial Sequence: cleavage 
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sequence 

<400> 18 
He Glu Gly Arg 
5 1 



<210> 19 
<211> 4 
10 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: cleavage 
15 sequence 

<400> 19 
Arg Gly Pro Arg 
1 

20 

<210> 20 
<211> 8 
<212> PRT 
25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: cleavage 
sequence 



30 



<400> 20 

His Pro Phe His Leu Leu Val Tyr 
1 5 



35 
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